Enhancing Contextualised Language Models with Static Character and Word Embeddings for Emotional Intensity and Sentiment Strength Detection in Arabic Tweets
نویسندگان
چکیده
Many studies have focused on Arabic sentiment or emotion classification tasks. However, research alternative aspects of affect, such as emotional intensity and strength tasks, has been somewhat limited. In this paper, we propose a method for enriching contextualised language model that incorporates static character word embeddings in tweets. We examine the assumption models using are trained specifically corpus containing extensive affect-related words can boost performance models. Through development character-level embeddings, found our is able to overcome limitations associated with out-of-vocabulary words, which common problem when dealing informal text. Given this, developed achieves state-of-the-art results detection social media.
منابع مشابه
Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification
With the development and the advancement of social networks, forums, blogs and online sales, a growing number of Arabs are expressing their opinions on the web. In this paper, a scheme of Arabic sentiment classification, which evaluates and detects the sentiment polarity from Arabic reviews and Arabic social media, is studied. We investigated in several architectures to build a quality neural w...
متن کاملLearning Bilingual Sentiment Word Embeddings for Cross-language Sentiment Classification
The sentiment classification performance relies on high-quality sentiment resources. However, these resources are imbalanced in different languages. Cross-language sentiment classification (CLSC) can leverage the rich resources in one language (source language) for sentiment classification in a resource-scarce language (target language). Bilingual embeddings could eliminate the semantic gap bet...
متن کاملSentiment Intensity Ranking among Adjectives Using Sentiment Bearing Word Embeddings
Identification of intensity ordering among polar (positive or negative) words which have the same semantics can lead to a finegrained sentiment analysis. For example, master, seasoned and familiar point to different intensity levels, though they all convey the same meaning (semantics), i.e., expertise: having a good knowledge of. In this paper, we propose a semisupervised technique that uses se...
متن کاملASTD: Arabic Sentiment Tweets Dataset
This paper introduces ASTD, an Arabic social sentiment analysis dataset gathered from Twitter. It consists of about 10,000 tweets which are classified as objective, subjective positive, subjective negative, and subjective mixed. We present the properties and the statistics of the dataset, and run experiments using standard partitioning of the dataset. Our experiments provide benchmark results f...
متن کاملLanguage Models with GloVe Word Embeddings
In this work we present a step-by-step implementation of training a Language Model (LM) , using Recurrent Neural Network (RNN) and pre-trained GloVe word embeddings, introduced by Pennigton et al. in [1]. The implementation is following the general idea of training RNNs for LM tasks presented in [2] , but is rather using Gated Recurrent Unit (GRU) [3] for a memory cell, and not the more commonl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2021
ISSN: ['1877-0509']
DOI: https://doi.org/10.1016/j.procs.2021.05.089